Recognize tone languages using pitch information on the main vowel of each syllable
نویسندگان
چکیده
An innovative method for speech recognition of tone languages is reported. By definition, the tone of a syllable is determined by the pitch contour of the entire syllable. We propose that the pitch information on the main vowel of a syllable is sufficient to determine the tone of that syllable. Therefore, to recognize tone languages, only main vowels are needed to associate with tones. The number of basic phonetic units required to recognize tone languages is greatly reduced. We then report experimental results on Cantonese and Mandarin. In both cases, using the main vowel method, while the number of phonemes and the quantity of training data are substantially reduced, the decoding accuracy is improved over other methods. Possible applications of the new method to other tone languages, including Thai, Vietnamese, Japanese, Swedish, and Norwegian are discussed.
منابع مشابه
Pitch Accent and Vowel Devoicing in Japanese
Japanese is widely recognized as a prototypical pitch-accent language, based on the fact that, given the “accent” location or the lack thereof, the tonal pattern of the entire word is totally predictable. Therefore, unlike tone languages, specification of the tone of each syllable is unnecessary. Consequently, it has been argued that, although Japanese may superficially resemble tone languages,...
متن کاملPitch Processing in Music and Speech
INTRODUCTION A highly-debated question is to what extent music and language share processing components. Beyond syntax and temporal structure processing, one studied aspect is pitchprocessing in a given domain and across domains (e.g., [1]). Pitch processing is crucial in music. For example, in Western tonal music, it is a form-bearing dimension (next to temporal structures). Pitch processing i...
متن کاملThe Prosody of Nigerian English
Nigerian English is a variety of English which has often been suggested to differ significantly from other varieties of English, especially in the area of prosody. This paper analyses the prosody of Nigerian English and compares it to the prosody of British English and three West African tone languages. Read and semi-spontaneous speech was analysed acoustically. Significant differences were fou...
متن کاملWord segmentation in Persian continuous speech using F0 contour
Word segmentation in continuous speech is a complex cognitive process. Previous research on spoken word segmentation has revealed that in fixed-stress languages, listeners use acoustic cues to stress to de-segment speech into words. It has been further assumed that stress in non-final or non-initial position hinders the demarcative function of this prosodic factor. In Persian, stress is retract...
متن کاملPreservation of lexical tones in singing in a tone language
Lexical tones are important for expressing meaning and usually have high priority in tone languages. This can create conflicts with sentence intonation in spoken language and with melodic templates in singing since all of these are transmitted by pitch. The main question in this investigation is whether a language (in our case the Mon-Khmer language Kammu) with a simple two-tone system uses sim...
متن کامل